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rapidly as possible the imagined letters for sojje physical property 
such as vertical, height. -This procedure allowed for a px»liminar.y 
assessment of the temporal parameters of visual- imagination, .The 
results delineate a number of visual iniage phenomena: (1) yisual 
ima^e sequencing and formation is guided by implicit speech .when J.o^g 
unpronouric^abli letter, strings are used; (2) when short pronounceable 
words are used, the entire string can be simultaneously represented 
in visual imagination, and i plicit verbal control is. required; (3) 
the capacity of the visual image system is very limited— between 
three to five letters, for short easily pronpuhc^able letter strings; 
and (4) visual image and visual percept representations of words are 
comparable for very short pronounceable letter strings, but as t^e 
length of the string increases visual image capacity fgr simultapfous 
representation is soon exceeded. .(Author/WR) 
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ABSTRACT 



In a- series of six experiments, subjects visually imagined letters 
or words and then classified as rapidly as possible the imagined letters 
for some physical property such as vertical heights This simple pro- 
cedure allowed for a preliminary assessment of the temporal parameters 
of visual imagination. The results delineate a number of visual image 
phenomena: . ^ ^ 

(1) Visual image sequencing and formation, is guided by implicit 
speech when long unpronounceable letter strings (alphabet) are used. 

(2) When. short pronounceable words are use4^ the entire string can 
be simultaneously represented in visual imagihation, and no implicit 
verbal contrbl is required. ^ 

(3) The capacity of the visual image system Is very limited, 
l)etween three to five letters for short easily pronounceable letter 
strings (wor^s)i , . 

(4) Visual image and visual , percept -Irepresentat ions of words are 
comparable for very short pronounceable^^letter strings (three letter 
words) j but as the length of thetstying increases visual image capacity 
for simultaneous representation soon exceeded. When this occurs, 
sequencing to the next set of^images again comes under implicit verbal 
control. In contrast, whep^isual percept capacity is exceeded by a 
physical string of lettptf^, sequencing can occur through simple scanning 
or eye movement. . 
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INTRODUCTIOlf 



Subjective \ <periaice is orice agaitrbecoming increasingly important 
to psychology. This is shown by the interest of fhe iayman (especially 
students) in such things as mystical experiences, drug experiences, 
rock festival experiences, and sensitivity training. -Each 'of these , 
things is in part a reflection of current interest in subjective phenom-* 
ena. Such interest is not confined to the layman. To take but four' 
general examples, there is a recent book edited by Tart^ entitled • 
Altered States of Consciousness j(1969), vhich deals with topics such as • 
hallucinations, dreams, and meditation in terpts of their subjective 
states. A second book edited by Jacobs and, Sachs, The Psychology of ^ 
Private Events (1971), is concerned with a variety of co.^rt states. 
And two other books. Mental Imagery by. Richardson (1969) and Mental 
Imagery ifi the Child by Piaget arid Inhelder (1971), are concerned 
exclusively with the topic of imagery. As indicated in several of these 
books, as well as in numerous articles, one common approach" to subjective 
states is to relate t;heir presence or absence to the presence or absence 
of physiological indicators. This approach would be even more useful 

(1) if we could be assured that subjects were in fact having the sub- 
jective experiences tfhey claita (i.e., eliminating the possibility that 
they sometimes respond on ^the basis of demand characteristics and 

(2) if we could describe subjective experiences in a richer language ^ ^ 
than that of simple occurrence and nonoccurrence. For example, it would ^ 
be of interest to describe s\\bj Active experience in terms of certain 
parameters, isuch as rate of occurrence. If in fact objective parametric 
description is possible, subjective phenomena can be profitably studied 
independently of physiological indicators. ^ ^ 

' * 

Such &ci approach is now possible with x'elatively simple forms of 
nearly universal subjective experience such as memory imagery (Richardson, 
1969), the topic of this proposal^ A general definition 6f memory 
imagery would be difficult, but it certainly would be a self-initiated, 
nearly .universal, ..sensory-like experience in the absence of correlated 
external stimulation (Weber & Bach, 1969; Weber & Castleman, .1970). 
According to Richardson (1969) memory images wotild also be under volun- 
tary control, and the content^ of memory. images would correspond to the 
content of past sensoi^ experience. Such a definition is quite broad 
but would exclude: (1) eidetic Imagery (Haber &'Haber, 1964), because 
it is rare at best, (2) after images (Craik, 1940), because they are not 
self -initiated, .and (3) iconic storage (Sperling, 1960, 1967; Neisser, 
1967), also because it is not self-initiated. 

' The occurrence of memory jLmagery— subjective sense-like experience- 
can be grounded in several theoretical views. For example, Paivio 
(1969) considers visual images to'be conditioned sensations^. A word 
like *'car" would act as the conditioned stimulus for a host of con- 
ditioned responses in the form of visual images of cars with various 
colors and shapes. * . ' 

Another >iie\7, that of Neisser (1967), holds that perception anlT 
imagery are the result of ^Wlysis -by-synthesis.*' In this view percepts 
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and images are both constructive acts. Percepts are the result of 
central operations matching or synthesizing the sensory input. The 
actions of the synthesizing then determine the subjecfiive experience and 
meindry rej^resentation obtained 'from a given stimulus. Th^t is, percepts 
are not copies of a stimulus, fchey are synthetic constructions, 
activated in order to match the physical stimulus representation.. In 
the same vay images are the result of the synthesizing operations being, 
activated in their own right, that is, without an. externally present 
stimulus as would be the case with percepl^s/ 

,Both of these views are perhaps of heuristic value, but. at present 
thqr do not seem to provide much deductive mileage. In my view what 
is "now needed is smaller**scale theorizing: about selected imagery 
phenomena , thou t attempting to explain it all in one theory. * If this 
is the case, then we need to study the basic- properties of imagery 
>^(6*g*> its temporal parameters). 

In this proposal temporal aspects of visual memory imagery for 
letters and words are considered* The visual memory imagery examined 
here, is nearly universal, as determined from subjective reports. In a 
number of the writer's classes over the last two years the incidence of 
visual imagery (in the sense of being able to imagine, with eyes, closed, 
the spatial forms of successive letters of the alphabet) is about 90 
percent (Weber U Castleman, 1970). Contrary to wide-spread impression, 
this figure is; very close to that obtained by Galton (1880), if we 
include his subjects \d.th relatively weak imagery (this seems to, be 
reasonable, because even weak imagery should be sufficient to imagine 
a common form like a letter). * , 



There have been a niimber of recent approaches to the study of 
visual imagery. A few pertinent examples* follow. Posner, et al. (1969) 
in*1:heir Experiment III have evidence based oh reaction times that 
auditorially presented letter names can be translated into a visjmal 
representation prior to matching (on a Yes/No basis) a subsequently 

'presented printed letter. Eaivio (1969) reviews jsl number of his studies 
indicating that learning in various tasks shows predictable relations 
to, independently scaled imagery values of word stimuli. Segal and ^ , 
Gordon (1969) have shown that the instruction to ^orm an image (visual 
or auditory) effects the subsequent detectability of perceptually 
presented signals (visual or auditory). When image and percept are in 
the same modality, detection is less proficient than \^eri.th^ are in 

•different modalities, possibly due to mutual interference or noise when 
both tasks appear in the same modality. The work of Brooks (1967, 1968) 
is particularly impressive for its use of objective indJ.cators. For 
example. Brooks has used block letters of the form: '^If • The 
asterisk would represent a starting position (upper left hand corner) 
and the subject's task when presented orally with the letter name "F" 
is to jgenerate a visual image of the block letter^ scan that jjnage 
clockwise, and indicate Yes/No for each successive corner qf the form 
whether it is an extreme corner (top or bottom) or an intermediate 
comer. , In this case the appropriate responses would be yes, yes, no, 
no, no, no, no, no, yes, yes. On the basis of various conditions Brooks 
is able to separate two distinct memory image modes, visual and verbal. 
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Two recent studies by the writer have emphasized the measurement 
of visual imagery rates with letters. The first study (Weber Bach, 
1969) used subjective procedures* ^ In the visual imagery condition 
subjects were asked to visualize the successive letters, of the alphabet, 
with their eyes closed, as they would appear one' at a time flashed on 
art imaginary raoyie-screen.i The rate obtained was an average of about 
2*5 letters per second, with considerable variability. This rate was 
in contrast to implicit and explicit speech rates, both of which are 
about 6 letters per second (Land a.uer, 1962) • Wliile these results were 
highly suggestive in pointing to jseparate imagery systems for visual 
and verbal representatiohs, the Variability associated with measuring 
visual imagery rates underlined ^he need for more objective procedures. 
Weber and 'Castleman (1970) found! an interesting objective procedure for 
dealing with visually imagined characters. In their Experiment 2 
lowercase printed letters of thej alphabet were partitioned into two 
subsets, vertically large letter3 (b, f, g, h, iy*y) and those 
that were .ot vertically large (la, c, e, i, m,.«.z)« Subjects were 
instructed ^d^yisually imagine such letters, in alphabetic sequence, 
and classify them for their vertical height. This task insured that 
the subject process each letter for its spatial property of vertical 
size. Subjects in fact reported using visual image)> of the letters in 
order to determine whether they were large or small characters. 
Experiment 3 was a control to determine whether subjects could have 
rotely learned, during the experiment, the appropi^i^te classiiEication 
response for each individual letter. The results were strongly' con*-* 
trary to such an interpretation! The possibility that vertical height 
properties were mediated by a verbal process acquired prior to the 
experiment did not accord with the subjective reports of visual images 
being generated nor with" other evidence concerning fatigue ratings. 
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With the visuar property condition, a pure measure of the time 
required to generate visual images was not obtained. In addition to the 
process of generation, the subject had to abstract from the generated 
image its height property and respond appropriat^ely; and this would 

the visual property condition seemed 
_ „ at a visual image is and for establish- 
ing a minimum standard of |:larity (the image must be clear enough to 
determine whether it is a large j letter). ^ In Experiment 2 yAiexi the 
visual property condition was combined with the subjective procedure 
of Vfeber and Bach (1969) the rate of generation for the subjective 
condition (less than 2 letters per second) was even slower than pte*** 
viously found by Weber and Bac.i/ In addition, th"e shaping j)f the 
imagery process by the objective^visual* property conditio^ was quite 
effective in reducing variability for the subjective procedure. 

In the studies that follow, subjects are ins-tructed imagine 
letters of the alphabet and to describe for each imagined letter a re- 
lated objective spatial visual property, for "example, to classify it 
as a vertically large or small letter. The principal response measure 
is time to process a letter string. 

Experiments 1, 2, and 2*: Verbal Control, of Visual Imagery? 
(This series of experiments has now been published with the following 



reference: Weber, J., Kell^, J*, and Little, Is visual image 
sequencing under verbal control? Journal of Experimentftl Psychology ^ 
1972, 96, 354-362, Experiment 2* was not part of the original proposal 
jtbut has been added for completeness.) 



EXPERIMENTS 1, 2, AND 2' 



Abstract 



Three experiments were conducted to examine the conjecture of 
R, J, Weber and J. Castleman in 1970 that sequencing between 
visual images of letters if sometimes under the control of 
verbal imagexry (implicit speech) in the^ens'e of saying a 
letter to oneself before visualising it*"" Verbal control of 
visual image sequencing' was investigated in alphabetic and 
word letter strings. Results with alphabetic letter strings/ 
were consistent with a verbal control process. However, re- 
sults with word letter strings did not provide evidence for 
verbal control of sequencing, presumably because the name of 
a short word can serve to represent' its several letters as a 
spatially parallel chunk in visual imagination/' It is con«*. 
eluded that the visual imagery system has a limited capacity 
for spatially parallel representation* When this limited 
capacity is exceeded, 'as in alphabetic string's, there is ^ 
Implicit verbal] control over visual image sequencing* 



It has been suggested that nonverbal imagery (visual imagery would 
be an example) and verbal p'^^ocesses (verbal imagery in the form of 
implicit speech would be an example) ire our two primary modes of 
symbolic representation (Bower, 1970; Paivio, 1971) • In adcfition, one 
of the imporjtant problems of fut\^e research in cognitive -psychology 
is seen as (fetermining how visual imagery and verbal processes are 
related to one another (Sower, 1970; Paivio, 1971). In this paper 
interrelations of visual imagery and verbal imagery (implicit verbal 
processes), are examined. , By "visual Imagery" we mean the ability to 
deal directly with memorial representations of spatial information, 'and 
fay "verbal imagery" we mean the use of memorial representations of 
linguistically related\information in the form of implicit 'speech 
(Webet & Castlexnan, 1970). Both imageries may be accompanied by sub- 
jective impressions of "seeing" or "speaking", although as Paivio points 



out this is not necessarily the case* 

Experiment 1 deals with possible verbal control of visual image 
sequencing in an alphabetic list, plus a secondary question involving 
the degree of ^generality of scanning rates for different imagined 
letter forms. Experiment 2 serves as a control for response mode time, 
\^ich was not considered in the first experiment* Experimttit 2* 1^ 
again addressed to the problem of verbal control of sequerfcing" between 
visual images^ but this time word letter strings as well as alphabet 



I In the first author's classes over the past several years (N » 
about 500) about 95% of the undergraduates claim the ability to visually 
imagine distinctive letters of the alphabet. Fully 1007* of the same 
students claim the ability to name letters in implicit speech* 
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letter strings arc examined. The comparison of word and alphabet 
strings is made on tK^ conjecture that entire word images, rather than 
their individual letters, can be represented as a single chunk; i.e., 
several letters at a time can be generated by the word's name and then 
held in spat ially\ parallel representation in visual imagination. \, 



Experiment 1 



Two recent stuuies have measured the rate at which visual and 
verbal imagery occi^r. The first study (Weber & Bach, 1969) used entirely 
subjective procedures. In the visual imagexry condition, Ss were asked 
to visualize rapidly the successive letters of the alphabet, with their 
eyes closed,' as they appeared one ac a time flashed on an imaginary 
movie screen* ^The rate obtained was slightly more than 2 letters/sec. 
This rate was in marked contrast to verbal imagery rates with implicit 
and explicit speech,' both of which were about 6 letters/sec, in close 
agreement with values from recent work using more objective procedures 
(Weber & Blago^sky, 1971). While these studies were highly , suggestive 
in pointing to separate imagery systems for visual and verbal representa- 
tions, the variability associated with measuring visual imagery rates 
underlined the need for more objective procedures. Hence, in another 
study, Weber and Cascleman (1970) developed an objective procedure for 
dealing with visually imagined characters. In their Exp. 2, lowercase 
printed letters of the alphabet were partitioned into two subsets. ' 
vertically large letters (b, d, f, g, h, j,*..y). and vertically small 
letters (a, c, e, 1, m**«z). The ^s were instructed ^o. visually 
Imagine letters, in alphabetic order, and to cl£|;Ssify each successive 
letter for its vc^rtical height. This task insured that S process each 
letter for its spatial property of ^ vertical size. The Ss in fact 
reported using visual lAages of the letters in order to determine 
whether they were large or small characters. When required to abstract 
from the image and verbally report the visual property of each letter, 
the rate dropped to a&out 1 letter /sec. Experiment 2* was a control 
to determine vl ot.her Ss could have rotely l^rned, during the experiment, 
the approprial classification response for\each individual letter. 
The results wr strongly contrary to such an Interpretation. ThB 
additional pos»jibility that, vertical height properties were mediated by, 
say, a rote verbal process acquired prior to the experiment dld^ not 
accord with the subjective reports of visual images being generated 
nor with other -evidence concerning fatigue ratings* 



However, verbal mediation may have been involved In a mor^ inter- 
esting sense. In Weber and Bach (1969j^ and Weber aind Castleman (1970), 
the sequencing between visual Images may^haye_been under verbal control. 
In fact, several Ss reported^ at the close of the explSrlment that they 



^Some letters ("1»« - NO, "t" - YES) are slightjly 
some type fonts, but once they are designated by E 
further difficulties (Weber ana Castleman, 1970, 



ambiguous in 
they present no 
p. ^ ' 



had to speak each letter implicitly before visualizing it«. Tills vQuld 
be consistent vith the findings of Posner, Boies, Eichelman, and Taylor ' 
> (1969)^ vhich indi cat'e^tkat^Ss can use letter ;names to generate vistial 
characteristics of letters • It is not clear how general such a 
strategy is, nor vhether it is optional or obligatory* For long serial * 
lists like the alphabet, perhaps^ isnpliclt verbal generation is needed 
to establish the sequential order of items, sir.cc the ordering capabilitie 
of memory for unrelated visual materials appear to be limited in compari- 
son tO' verbal materials (Paivio, 197iy* Whatever the case, it^ seemed 
necessary to examine directlyj^the possibility of verbal generation and 
control of visual image sequencing in ordered letter lists like the 
alphabet* ^ ' 

Examination of visual image and implicit verbal relations 5s made 
possible by having an objective indicator for ijnagery (Weber & -i 
Castleman, 1970) and by the fact that implicit and explicit (aloud) 
speech take place at about tlie same rate (Veber & Bach, 1969; Weber f 
Blagowsky, 1970)* 4 experimental question concerns Whether an alouJ 
scan mode (saying .ch, letter of a list before reporting its visual 
property) will res&lt in the same^rate of reporting visual properties as 
an unspecified scan (not aloud), in vhich £ does not say successive, 
letters explicitly but just reports their visiial properties. If the 
rates are comparable for th^ tvo kinds of scan, then we would have 
-evidence independent of subjective iropressioas'^ that implicit speech can 
accompany the sequencing of successive visual Images in ^he not«-aloud 
scan* This, together vith the subjective impression of having to say 
each letter before visualizing it, would be consistent with the notion 
of verbal generation and control of visual image sequencing «towever, 
the implicit speech acco.npanimetit might be purely ancillary, tha*: is,^ 
without functional significance* So still another way is required to 
approach the question of verbal sequential control, and to this end. we 
examine also the role of response indicators in denoting the spatial 
characteristics of visual images* Suppose that visual image sequencing 
is med5.ated by saying successive letters prior to visualizing them (the 
implicit speaking might be necessary to est£Lblish the sequential brder«* 
ing the letters)* Then we would expect a spoken response u.ndicator 
(YES or NO as to whether the letter has the property in question) to be 
produced and output before the next letter could be implicitly named 
and then vi&jalized (saying both YES/NO and speaking the name of the 
next letter could not take place simultaneously)* But if the response 
indicator were written (a line for YES and a dot "**' for N0>, then 
it might be possible to begin sayipg implicitly the name of the next 
^ letter and producing its image ^ile executing the written response 
classification for the preceding letter* This, of course, assumes that 
writing and speaking are at least partially independent of one another 
to the extent that S^s are able to say one thing while writing something 
else* 



Method 



Subjects >****The Ss were 48 undergraduatie volunteers who received 
extra course credit for their participation. There were 12 ^s randomly 
assigned to each of four different between-^s conditions. additional ^ 

Ss were discarded for not following instructions and/or having extreme 
difficulty in completing the task. * . 

Procedure.-- Upon entering the experimental situation, ^s were 
checked for their familiarity with the alphabet and given remedial 
instruction if necessary. All Ss were shown an index card with, lower- 
I case, typewritten, letters of the alphabet in random sequence, and then 
instructed to differentiate between vertically largo letters: (YES 
responses) and those that were not yertically large (NO responses), 
This distinction wa^ similar to that used in the Visual Property condi- 
tions of Exp. 2 and 2' in Weber and Castleman''(1970), although YES/NO 
serve as th^ responses here rather than , the LARGE/SllALL of the earlier 
study. During the experiment proper, S was to imagine visually the 
letters JLn alphabetize order and, as rapidly as possible, to classify 
each imagined letter as YES /NO. At .the beginning of each trial i S ^ 
started a remotely controlled Standard Electric clock \*en he was ready 
to begin and stopped it as soon as he had classified the last letter 
of the alphabet. The S. did not receive feedback on his times, but 
speed arid a *low error- rate were emphasized. If an error ^id occur, 
it was pointed out' to £ at the end of the trial. This constituted the 
Lowercase' (LC) condition* In addition, there was a comparable Uppercase 
(UC> condition in which Ss distinguished between letters with long 
vertical lines (B, D, E, F, H, I...) which required YES responses and 
those without long vertical lines (A, C, G, J, 0, Q,. ..) which required 
\N0 responses. Each S^ was to imagine both LC and UC alphabetic lists 
alt^matei^^er 10 blocks of LC and UC trials. The LC and ^UC conditions* 
were balance3TKir*Hjei^ialposition across S,s» 

There were also four between -S^^SBdirtioiis^ The first two involved 
scanning mo*de. An S^ scanned the alphabet by speaktng-it^aloud (SA) . or ' ^ 
by a nonspecified, not-aloud (NA) procedure as he generated Titr imagina- 
tion successive letters to examine and respond to their visual properties. 
In the SA conditions, S would proceed as follows (for LC letters): "a''- 
NO, "V»-YES, "c"-NO, "2"«N0; that is, speak each letter before ^ 

responding Td.th YES or NO to describe the vertical height of the imagined 
letter* In the NA conditions, no instruction was given regarding scanning, 
and-S^ did not speak aloud the successive letters before responding 
overtly with YES- or NO. That is, he would proceed as follows: NO, YES, 
NO, NO. It was, of course, possible that ^ would generate the 

successive letters in speech imagery before generating their visual 
images, even though he received no instruction to do so,- but this was 
an empirical question to be/ answered by the similarity of processing 
rates for the SA and NA conditions. 'iVo other bet/ ien«S conditions in- 
volving spoken or written responses (SR^ WR) were combined factorially 
with the scanning mode conditions. ? the SR conditions, S simply said 
YES or NO for the visual property of ch successive letter. In the 
WR conditions, S wrote a long vertical line corresponding to a YES 
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response or vyrote a dot corresponuing to a NO response. The S. was 
to classify the alphabetic string as rapidly, as possible and to keep 
his eyes closed, in all xpon^i^ions, so he \^ould not visually monitor ' 
his WRs.. . ' , 

In smmiary, the design "tims is ted of a factorial arrangement of 2 
ScanningJ^Jg^gis (SA, NA) x 2 Response Classes (SR, WR) x 2 Letter Cases 
CLC, UC), with scanning mode and response class between 3s and letter 
case within Ss. The principal response measure was time to process 
the 26 member^ imagined alphabetic list. 



Results and Discission 



Descriptive finaiiigs^-arejj)resented' in Fig. 1, in which mean time in 
seconds is shown as a functio^l>f practice. The left ordinate shows 
mean time per 26 letters and the rigi.t' ordinate Is^^the same information 
on a time-per- letter basis. The left-hand panel is for^lowex:iiase 
letters and* the, right -bancf panel, for uppercase letters (the boin^onThalf^^ 
of the left ipUnel contains results from previous experiments that will 
be commented on latef^;-.;-5Ehe^parenthetical number triple next to each 
function represents the mean, standard error of the mean, dnd the scan 
rate in letters per second processed. Each mean is based on 120 obser- 
vations (12 S^s X 10 Blocks); the standard error of the mean is a between- 
Ss measure of variability based on each S,*s mean; and the scan rate is 
simply the reciprocal of the mean. ^Errors (omission and, commission) 
occurred for fewer than 2% of the lette?:s (a phase error was counted 
only once). Those trials on which errors did occur were not temporally 
disparate from correct trials, so no further distinction is made. 
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ts for lowercase letters and right piinel for upj>ercasc~rcttcrs.) 
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Mean processing times are almost Identical for spoken scanning 
(SA) and unspecified scanning (NA), with F (1,,4A) < 1. The absence of 
a significant scan effect is consistent x^th the view that implicit 
speech scanning is involved in the NA conditions. However, the dif- 
ferences in variability for the two scans do suggest at least some 
difficulty with as suming that completely identical processes are involved. 

•There is a substantial difference between WR and SR modes, F (1, 
44) « 14.55, 2 < •01» Employing a WR mode takes much less time than " ' 
an SR mode, even though writing is probably a slower response system 
than speaking. Again this is consistent vith the vie\7 that sequencing 
between- visual images is accompanied by implicit verbal processes. SRs 
appear to delay the hypothesized v^r^al control processes involved iri 
generation and sequencing between successive visual images of letters. ^ 

Finally; the results for upper- and lower-case letters suggest that 
visual imagery rates are about the same for two quite different classes of 
visual forms; J" (1, 44) ^ 2.56, ^ > .05* This would tend to increase* 
the generality of prior- findings (Weber & Bach, 1969; Weber SrCastleman, 
1970) T>y suggesting that visual imagery rates are constant over fairly broad 
fluctuations in visual form. Just how large a variation in visual form 
would be possible before differences in rate occurred is open to question. 
None of the main effects showed significant interactions with one another. 

Large^practice effects are also evident in Fig* ,1. This, is in 
contrast to some otfier imagery tasks, as shown in the bottom half of 
the left panel* The visual imagery '(VI) function is from Weber and 
Castleman (1970^ Exp. 2) and did not require to abstract the height 
properties of. the letters or to respond YES/NO. The procedure was 
entirely subjective; simply stopped the clock when he had completed 
visualizing, one at a time, the letters of the alphabetic string. The 
large practice effects with, the visual property task of the present 
study would seem, then, to be a result not of visual image representation 
but of abstraction and/or overt response time. Both response conditions 
of the present study seem jto be stabilizing somewhat above the VI con- 
ditidn, as would be expected if abstracting height properties and 
responding YES/NO takes additional time over visualizing letters* The 
bottom function is for verbal or^^peech Imagery (SI) and also from 
Weber and Castleman (1970, Exp. 1). It simply\required S to say to 
himself, as rapidly as possible, the letters of the alphabet and stop 
the clock \dien finished* It provides an estimate of the minimum com- 
ponent time that would be required of a verbal^ control letter-naming 
process in a VI condition. « '> 

In reviewing the evidence for verbal control in the sequencing of 
images, perhapjS most compelling is the multiple approach to the verbal 
control issue:. The absence of a significant scanning effect (SA, NA) 
is consistent with an implicit speech letter -by- letter naming process 
in the NA conditions. The presence of a significant response mode 
effect (SR, WR) is consistent with an implicit speech process, generat- 
ing and controlling the sequencing of letters, which is delayed by the 
overt verbal response of categorizing letters as YES/NO# And once 
again, there is the strong subjective impression of having to say each 
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letter before visualizing %t. 



A cautionary note is still in order. The response mode effect 
would come as no surprise if vjriting wer6 in^ fact a^ore rapid response 
mode than speaking, Tliis^ unlikely possibility is c:^nsi^ered in Exp. 2 , 
Another 'related possible explanation of the response pode effect is 
that the written line and dot responses are initially more compatible 
with the height properties of visual images tlp^n are the spoken YES /NQ 
responses. But if this were the case, we wpdld ^pect WR and SR mode 
functions to converge rapidly across bloclc^/p^ practice, something- tliat 
does not occur. 



Each of the findings and afgumen^e for verbal control may not be ^ 
completely compelling * in ^its own ri|;ht, but taken." together they provide 
a strong line^ oJE^^cohverging evidenpip pointing to theysiinple notion that 
S, says each letter ^efore viluali^^ing it. * The generality of such a 
notion is iinmediately suspect, ,^d Exp* 2' ^in this^series is directly 
addressed to the question of \d/en verbal accompaniment of visual image 
sequencing does and does not^alce place. / 





Experiment 2 >u 

- ' ■ 

It seems obvious/that in most ins^^ces^ "equivalent:" materials 
can be spoken at least as rapidly as they can be writ teri. The findings 
and interpretationi'^of Exp. 1 depend /on such an assumption. However, 
we could not seem/to' find evidence on this point in the recent psychologi- 
cal literature. /Also it is not C9mpletely clear how equivalent spoken 
YES/NOs are to ji^itten lines and /dots— perhaps the written responses 
are more rapid. A direct comparison of writing and speaking rates is 
called for, in order to strengtiien th^ rationale of Exp. 1. Such a 
comparison also has intrinsic/ interest since writing and speaking 
are, perhapfe, our two most important means of sigimling information to 
others about our internal ^tates. Moreover, writing and speaking are 
widely /used as res^d^n^'in^des in psychological research, without i^uch 
regard to their potentially very different output rates and the implica- 
tions t^is has for, say, short-term-memory decay work. 

i In this experiment, then, response times for speaking and writing 
ar^ ^tssessed. Por future use, two different symbalic materials are 
enipj-'oyed: an' alphabetic list and a binary list consisting of spoken or 
written equivalents^ of YES/NO. The written analog of YES/NO is, as in 
2xp. 1, a vertical line and dot, respectively. 



.Method 



Subjects.-- The ,Ss were 10 volunteers from an undergraduate psychology 
class. . % 



Design and procedure . --A factorial design was used, with 2 -Types of 



String (Alphabet, Binary) x 2 Response Modes (Written, Spoken). Both 
factors were within Ss. In the written condition of the alphabet task, 
S^s were asked to write the alphabet in their normal connected cursive 
handwriting as rapidly as possible. (Response time was, always the 
dependent variable.) In the spoken alphabet condition, Ss were asked 
to , speak the alphabet aloud as rapidly as possible. In the spoken binarj* 
condition^ Ss spoke aloud a 26-meinber series of alternating YES/NOs 
(YES, NO, YES, NO, ...). In the written binary condition, Ss wrote out 
a 26-member series of alternating- lines and dots ("/•/." etc.)» The 
lines and dots c orrespp rtd-to the YES/NO written classification of Exp. 1. 
In the binary tar^ks, we did not wish to require Ss to count their 
responses to have only 26, so a sheet of'paper was divided into seven 
columns 1 in. wide. Jor the written binary task, S^ wote his responses 
in clusters of 4/column (/./.; /./.; etc.)., except for the last column 
\^ich required only 2 responses to equal the 26. For the spoken binary 
task, S kept track by moving his finger to a new coltimn on completion 
of each cluster of four spoken responses. 

- Procedures for* timing were the same as for Exp. 1. Each S, was given 
six trials in each of the four conditions. A trial consisted of a 
single processing, i.e., writing or speaking an alphabetic or binary 
list. Following the six trials^ of the first condition, each S was 
given the six- trials of the next condition, etc. The orders of the four 
conditions were randomized for each S.^^ 



Results and Discussion 



Mean response time in seconds for eac|v^ task is shown in Table 1. 
The response time for the WR mode is greater than for the SR mode on 
both the binary and alphabet strings. An analysis of variance shows 
that the type of response has a significant effect, F (1, 9) - 172.22, 
£< .01. The type of string does not have ,a significant effect, F 
(1, 9) = 1.84, £> .05. However, there is an obvious significant 
interaction between type of response and type of string, F (1, 9) ^ 9.9.92 
2'< .01. The main comparison of a- priori concern is t^at of the binary 
spoken and written, conditions.* Spoken responding is reliably faster 
than written responding, with correlated t^ (9) « 3.87, £< .01. 

The greater response time for the WR mode as compared to the SR 
mode for both the binary and alphabet strings reflects in part response 
times for different motor acts. The significant String x Response 
interaction can be interpreted as meaning that with written responses, 
JSs can execute the simple vertical lines and dots required by the binary 
task faster than the more complex curved lines required by the alphabet 
task. ■ But when spoken responses were usred, Ss could say the~alphabet 
faster than the string of 26 alternating YES/NOs. 

These results suggest that the differences between WR and SR modes 
of Exp. 1 may have been conservative, since in sheer response time 
(unaccompanied by a visual image task and with a completely regular 
alternation of YES /NO responses), writteji responding is slower than 
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spoken responding. It ,is, of course, possible to argue that sequences 
other than aji alternating YES/NO would produce different results, that 
different sequential patteams would interact differently, ^ith the tw6 
response modes, but such effects, if they occur, are likely to be 
small indeed compared to the basic distinction between x^ritingand 
speaking. 



TABLE 1 

MEAN AND STANDARD ERROR OF THE MEAN FOR PROCESSING 26 ITEM 
BINARY OR ALPHABETIC STRING, EXPERIMENT 2 







Binary 




Alphabet , 


Statistic 


Written 


Spoken, 


Written 


'j- Spoken 


X 


9.37 


; 7.50 


12.87 


4.71 




• 1.55 


1.80 


2.07 


1.30 



Note. —Means averaged over 10 Ss x 6 trials for each condition. SE^ 
is ^measured in seconds. 



Experiment V 



Experiment 2 indicated that sequentially generating the visual image 
representations of an ordered list like the alphabet is under verbal 
TTegul^tion. But it is clearly the case that sequential ordering for at 
least some visual images is not vjerbally controlled in the sense of 
first having to say a verbal counterpart of eacn imagined item in the 
list. Likely examples would include dreams, hallucinatory experiences, 
and visual reverie. Such images seem to have a sequential life of their 
own, and their content and order often defy verbal description. Another 
case in which individual images do not seem to be under sequential 
verbal control would seem to be spatially parallel composite images, 

that is, images with several different individual components appie aring ^ 

simultaneously but separated in subjective space. An- example would be 
a shoreward, the letters of which might be imagined at the same time. 
Thus the letters of the word ''cat'' might be represented in visual 
imagination sequentially "c", "a," "t," or simultaneously "cat." 
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Extracting visual spatial properties from a letter strix\g that is 
sequentially generated (alphabet^ may well be different from abstracting 
visual spatial properties from a letter string in which all letters might b 
represented simultaneously in imagination, as in a short word. 

In this experiment, the relative rate of abstracting visual 
properties from different types of letter strings, i.e., from the alpha- 
bet and from words, is measured under different scanning and response 

^--conditions. By using t\70 different scanning modes, spoken ^d non- 
spofeen^^^d two different respoti&e modes, ^^itten and spoketvsthe * 

^ presenceot^ab^ence of individual verbal letter generation in vi>^al 
imagery for word versu^^lphabet strings can be investigated. 
visual image formation ofSrhe^^^uccessive individual letters of a word 
is also governed by saying impl^citiy^jeach successive letter, then rela- 
tive scanning times should^ be the sai?ae^ for^pqk^ and npnspoken scanning,, 
and written responding should be faster than spolrenr--x^sponding. The 
rationale is the same as Exp. 1. However, suppose *^thel?5rd^-4iame (to- 
gether with the appropriate instructional set) is sufficient to'^^a^jise 
a spatial simultaneously available visual image of the entire word (o: 
^.at least several letters of it). Then speaking each individual letter 
of the word would not be necessary prior to visualizing its letters, 
and an NA scan should take less time than a letter-by-letter SA scan. 
Also, the difference between the SR and WR modes .should diminish-. for 
the NA scan in comparison to the SA scan, since in the former cdhdition 
full advantage could be taken of the simultaneously available image 
without the .additional potentially disruptive process of saying each 
successive letter one at a time,. - 



Method 



Subjects .— The S^s were 40 undergraduate college students who par-' 
ticipated in order to fulfill an introductory psychology course require- 
ment. Two potential S^s were not used because they could not recite 
the alphabet correctly. One other potential S, was not used because 
he could not learn to represent imagined large and small lowercase, 
typed letters of the alphabet with YES and NO, respectively. Tlie S^s 
were assigned at random to conditions, with 10 Ss per between-S^s condi- 
tion. 

t ^ * <' - / # 

Design and procedure . — The experimental tasks used in this experi- 
ment involved the use of the alphabet and 18 four-letter words; all words 
were of one syllable and were high-frequency-usage words (defined as A 
or AA by Thorndike and Lorge). Each of the words contained both verti- 
cally large and small letters, e.g., "girl." From images'of -the two 
different leCter strings (words and alphabet), Ss were to abstract 
visual properties of the successive letters. Thus the image "girl" 
would give, rise to the response sequence YES, NO, NO, YES. The design ^ 
and procedua;e were similar to that oi Exp. l and consisted of a\ 
factorial arrangemlent of 2 Response Modes (Written, Spoken)«x 2 Scans. 
(SA, NA) X 2 String Types (Alphabet, Word). The first two factors were 
between Ss and the last factor was within Ss. 
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Each £ was given several preliminary pracSce^^ial^" tor 
abstracting visual properties from the letters of the alphabet and for 
abstracting visual properties from the letters of a practice vord. A 
spoken ready signal was given, and E started the remotely controlled 
clock with the onset of the spoken cue ("alphabet" or "X", where X was 
one of the stimulus voifds). The S was to respond as rapidly as possible 
by classifying the succi^ssive letters as YES/NO* Tlie E also stopped the 
clock on complefion of the visual property "spelling" of the V7ord or 
alphabet, i.e*, after the response to the fourth or twenty-sixth letter* 
This was a departure from the previous two experiments in which Ss con-# 
trolled the clock. The departure was necessary because pilot data 
showed that E was more accurate than Ss in measuring the short response 
times involved with words in this experiment. The response times were 
measured from the onset of the alphabet or word cue to the. response to * 
the last letter in: a string. To provide greater accuracy in measuring 
response ^imes, E also tape-recorded all responses ^and later played 
b^ck the tapes at half-speed. The m's spoken responses were recorded 
as usuall To tape-record S^'s written responses, a soft lead pencil 
was used to write on, an aluminum sheet to which a microphone was taped. 
The distinctive sounds made by S's pencil on the aluminum sheet were 
readily picked up ^d served to demarcate response onsets. Again, did 
not visually monitor his responses. 

Each S was given 10 trial ,blocks. Each block vas composed of 6 
stimulus words of 4 letters each and one alphabet string. This allowed 
or roughly the same amount of work per block for both types of letter 
strlng..Xactually 24 letters for word strings and 26 letters for the 
alphabetl^ri^g^j^na given block). The 6 words for each S were randomly 
selected from' the poot-otJ^S, 4-letter words. The same 6 words ^appeared 
in each of the 10 trials given"'1:o--a....ES£^ but different Ss 

received different sets of 6 words toinctca^eu.^enerality. The order 
of appearance for words and. the alphabet was randwSizeA^-^om block to 
block.' 



Results 



Each S^'s mean response time across trials was calculated. In order 
*to compare directly Sjs responses in processing the word and the alphabet 
lettej: strings, his response times in each condition were computed^ on a 
time-per-letter basis. Therefore, each ^^s mean processing time per 
letter was the unit used in the subsequent a nalyses ^ ^ 

As can be seen f rom , Table 2, the processing times are slightly 
faster in the NA scan conditions than in the SA scan conditions, F (1, 
36) 5= 5.53, 2, *^ •OS. Processing times are also faster in the word 
condition than in the alphabet condition, F (1, 36) = 97.99; anH vaster 
in the WR condition than in the SR condition, F (1, 36) « 46.01. .th 
PS < .01.^ ' f \ . • 
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TABLE 2 



rmm, standard error of the mean (sEj,j), and rate as a 'function 

OF STRING, SCAN, AND RESPONSE MODE, EXPERIMENT"?.' 



String and 
Scan 






Response Mode 








Spoken 






Written 




M 


SEy 


Rate 


M 




Rate 


Alphabet 








. ,61 


.16 


1.64 


SA t 


1.01 


.14 


.99 


NA 


.96 


.27 


1.04 


.60 


.15 


1.67. 


Words 














SA 


.86 


.10 


1.16 


• .49 


.12 


2.04 


NA • 


.53 


.09 


1.89 


.47- 


.12 


2.13 








— T* ~— 









Note. --Means are detemined by averaging over 10 Ss and 10 trials per 
condition. SE^j is measured in' sec/letter and rate ds measured 
in. letters/sec. ' , 



Of particular interest are the interaction effects in this experi- 
ment. There is a significant Scan x String interaction F (1, 36) = 
11.20, \ < .01, indicating the differences between spoken and nonspoken 
scans were greater in the word condition than in the alphabet condition. 
There is. also a significant Response x String interaction, F (1, 36) — 
15.27, £< .01, showing that the differences between SRs and WRs were 
greater in the alphabet condition than in the word condition. Finally, 
there is a significant three-factor Response x Scan x String interaction 
F (1, -36) - 11.20, £< .01, indicating the relatively unique character 
of word strings with SRs and NA scanning. 

Practice effects across t-rial blocks warrant comment. Tliey were 
relatively flat for word strings, with no indj,ca1:ion of a Condition x 
Practice interaction. The alphabet strings had practice effects 
similar^ to the corresponding conditions of Fig. 1, Exp. I, again with 
no indication of a Condition x Practice interaction. 

Discussion" . ' * - , 



The equivalence of SA and NA scanning for alphabetic strings 
s with the findings of Exp. 1 and of Weber and Blagowsky (1970). 
sent results are also consistent with the notion that the visual 



imaging of individual letters in an alphabet string is under verbal 
.control in the sense that each letter is implicitly spoken prior to ^ 
imagining it* In contrast, the results for word strings are not equiva- 
lent for spoken and nonspoken scanning and are therefore inconsistent 
with the notion that S implicitly speaks each letter of a vord when 
.using a nonspoken -scan mode. Verbal control over the sequential genera- 
tion of visual images of the letters is not needed vith the vord letter- 
strings used in this study. Instead, on the basis of a word's spoken 
name,\S mhy visually generate the word as a whole, or at least w£::h more 
than one letter af a time, simultaneously represented in visual imagina- 
tion. The S could then scan left to right for the visual properties of 
the simultaneously available letters without having to fall back on' 
implicit verbal processes. This is Indeed the subjective impression 
of what is done. The time advantage with ^nonspoken scanning would arise 
because the names of the individual letters constituting 'the word would 
"not tifeed to be verbalized, since the visual image representation of 
the word would be simultaneously available prior .to scanning individual 
letters. Hence, with word strings the vejbal YES /NO responses should 
be comparable in .rate to the written YFS/NO equivalents, as is' the 
case. Along these sajne lines, the sequential generation problem dis- 
appears, 'even for alphabet strings, if the" string is perceptually 
available. Weber and Kelley (1972 have found that the spoken/written 
difference for the visual property task disappears when the alphabet 
string is visually present in front of S. They hypothesize that the 
sequential ordering problem is handled in that case by left-td-right 
eye movem.ents over the string rather than through the- use of implicit 
verbal processes as suggested here. 

While the processes suggested for word and alphabet strings in 
the present series' of experiment may seem dichotomous,. it is likely that 
scanning a long polysyllabic word for the visual properties" of its 
letters would be part way between the limiting cases of short familiar 
words versus^ long serial alphabetic lists. We Xv-ould expect spatially 
parallel visual image representation to be pos^-ible within familiar 
letter clusters with a one- or. perhaps two-syllable name*, but sequencing 
between syllables for longer words rfight well be under verbal control, 
as with the alphabetic strings. Thus, the visual property scanning of 
'the imagined lowercase word "oklalioma" might allow^ for the spatially 
parallel visualization of letters within a syllable, whil^ the 
sequencing between syllables would involve verbal processes. Tliis 
would certainly seem to be the case with a word like "supercalifragillstic 
expialidocious." * ^ 

The picture that emerges is of a visual imagery system with a 
capacity for limited, spatially parallel representation (Paivio, 1971). 
VJhen a familiar letter string With a short name t-'lls within that • 
capacity, visual image sequencing is not verbally ^gerferated. But when 
a letter string exceeds that capacity, sequential generation and order- 
ing ^ecome necessary and visual imagery comes under the control of. 
verbal imagery in the form of implicit speech. 



\ • References 



Bower, G. Analysis of a mnemonic device* American_Scientist^ 1970, 
58, ^^6-5X0. 

Paivio, a/ I magery; and verbal processes * New York: llol^, Rinehart 
& ' 



t WiAston, 1971* 
:, mI I*, Boies, S.J*, 



Posner, Ml I*, Boies, S. J*, Eichelman, H,, & Taylor, R. Reten- 
tion of visual and name codes of single letters* Je^Hial of 
Experiment al Psychology -.^Monograph ^ 1969, (1, Pt? 2). 

Weber, R. J., & Bach, M. Visual and speech imagery. British Journal o£ - 
Psychology . 1969, 60, 199-202. 

t 

Weber, R. J*, & Blagowsky, J* Metered memory search with implicit arid * 
explicit scanning* , Journal of Experimental Psychology ^ 19K), 
84, 343-348* . , ^ 

Weber, R. J*, & Blagowsky, J* Metered memory search and concurrent 

chanting. Journal of Experimental Psychology ^ 1971, 89, 162-170 • 

Webet, R* J*, & Castleman, "J. The time it takes i.to imagine. Perception 



r 



> & Psychophysics , 1970, 8, 165-168* 



Weber, R*. J*, & Kelley, J.^ Aspects of visual and acoustic imagery. 
Psychonomic Science . 1972, 27, '121-122* 



' EXPERIhENT 3: IMAGE AND PERCEPT BE PRESENTATION OF LETTERS 

The purposes of this study are to compare processing rates for 
visual and acoustic imagination,^ to compare rates of abstracting visual 
and acoustic properties fx^om imagined vs perceptual represencations of 
letters, and to show by manipulation of response<mode and representation 
the possibility of verbal control of visual image sequencing. Rates of 
processing letters in visual imagination have been previously investi- 
gated (Weber & Bach, 1969; Weber & Castletnan, 1970), and the sequencing 
faroblem has been dealt with at some length by Paivio (1971). The 
sequencing problem pertains to how, in a serial list of visual images, 
the S gets from one visually imagined form to another. In the Weber et 
al studies, Ss claimed, subsequent to the experiment, that it was / 
necessary to say implicitly each letter before visualizing it. It- is / 
hypothesized here that the -series of visually imagined letters of the / 
alphabet is verbally ordered in the sense of S's saying implicitly each 
successive letter before visualizing it. If so, then vje have a poten- 
tially, important interrelationship between our tX7o most important sym- 
bolic codes, visual imagery and verbal processes (Paivio, 1971). Inves- 
tigation of these foiTegoing problems seems to recluire an objective 
method of assessing the existence of visual imagery. 

# 

y Following Weber &X:astleman* (1970, Experiment 2), it is possible to 
r^ploy objective criteria of imagining lettcirs and t:6 assess they rate at 
which letters are processed in imagination. These investigators^iad Ss 
classify imagined lo^^ercase letters on the basis of'height. Some, letters 
are vertically small (a, c, e, i, z) and other letters are verti- 

,cally large (U, d, f, g, ...,'y^» The Ss x^ere instructed to. imagine . 
visually the successive 'alphabetic let.ters and to classify each suQces- 
sive letter for its vertical height. The Ss reported visually imaginiVig 
' aud were tfble to correctly ^classify the letters, using a verbal response 
("sjnall," *'large") at a rat^ of about '1 letter per second. Th% rate 
differences for the objective and,, subjective 'procedures require, comment. 
If Ss did, ijsi^fact, say implicitly cAch successive letter b.efofc 
visualiziiSg it, then we would expect that a verbal classification re- 
sponse would compete for processing capacity with the coveirt verbal * 
' sequencing. If this, is the case, then processing rate ought to be 
. faster with a nonve>:bal classification response. Thus, if j&s are to 

give either , a verbal YES/NO or a^writt^n line/dot equivalent (/,.) in 
' responding to each imagined letter, we would expect slowaf rates f pr *the 
verbal response xaode than for a written response mode ii!%rexbal control 
of the sequence is required. This would be due ,to competition for 
verbal* processing space "on the part of both the overt verbal YES/NO^ 
response and theMmplicit verbal control of sequencing. ^ 

The general technique of abstracting from* images the correlates of * 
physical stimuli also can be applied to the study of. .acoustic Umagi- 
nation, tetter names possess acoustic properties that pan be ^ed as 
objective indicators of acoustic imagination. Thus, some lettcjt names *' 
have a' long e sound, /if (b, c, d, e,\.., z), and other lettet' names 
/doW (a, f, h, .e., y). If S can correctly classify successive " . 
alphabetic letters according to their acoustic properties without ac- 
tually saying tl^ejn aloud, we can b§ reasonably sure that acoustic 
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imagination is at vork*i When doing this task, there is' again the strong 
impression of saying* caih letter prior to CNtracting its acoustic prop* 
crty. This Tfiay^be. necesjsnry omly for producing and abstracting the 
acoustic property of the imagined letter, but it may also have rf' sequen- 
tial control function, as previously conjectured for visual imagery. 
* 0 

In somfc. preliminary work, we have found the actual rate of - letter 
classification to be very similar for visual and acoustic properties. 
But, Subjectively, visual and acoustic imagination are very different. 
Accordingly, we liave sought to find ^t least one objective variable that 
clearly separates them. Tlve variable selected was imagined vs percept 
tual representation of the alphabetic sequence. It was reasoned that 
letters visually present (percepts) tcould have their visual pro,?erties 
immediately available for abstraction, whereas their acoustic proj,crtics 
would be no more availaiyie *th;Hi in imagination. In addition, when the 
letters are visually present, as percepts, the sequencing proSlemncoul. 
be handled by eye movements rather than by implicit veifbaliysation. 
Hence 4 the differences in rate between written and verbal response mode^? 
shoulld dimiiiish. 



Method 



Subjects. — Twenty -undergraduate volunteers wfere paid for their par- 
ticipation. One additional S was discarded for inability to do the 
task. Ten S^s were assigned at random to each of two between-S^s condi- 
tions. * ** 

4 

Design and Procedure . --The design consisted of a factorial arrange- 
• ment of tvo methods of representation (image, percept) by two properties 
(visual, acoustic) by two response modes (spoken, written). Response 
mode was between S^s, and*the other two factors were withiix Ss* Repre- 
sentation could either be imaginal*, meaning that the Ss somehow implic- 
itly generated the successive alphabetic letters, or perceptual, meaning 
tliat a printed lowercase serial list of the alphabet was v^su'tly pres*- 
^ent before the £ while he made his responses. The propertie' 'Of the 
letters were either visual (in which'case ^ had to distingut/^^i between 
'.vertically large and vertically small letters 9f the lower^^se alphabet) 
or acoustic (in which case £ had to distinguish between letters ^ith the 
long e sound, /i/, in their naK'es from those that did not\f. Vertically 
large letters ^and long e letters were to be classified as YESs^ and the 
other letters were to be classified as NOs. For the spoken response 
mode, S said YES or NO for each successive letter of the alphabet, as he^ 
classified it for its properties.^ Thus, in the, visO^l property -imagc^j^ 
condition, £ would emit a string of the 'followilig sort: NO, YES, NO,, , 
YES, NO, becauae the successive letters of the alphabet (a, b, c,' 

d, , z) have that pattern of YES/NO classif icat;ions of vertical 
height. If the letters were to be processed for tnfeir acoa*4:ic proper- 
ties, the corresponding str-rng of YES /NOs would be: NO, YES, YES, YES, 

YES. WJien all^^letters are so consi4ercd, there are, for visual - 
properties, 12 YES and 14 NO letters. For acoustic i^roperties, there 
are 9 YES and 17 NO letters. Hence ^ there is somewhat less response 



uncertainty for acoustic thau for visual properties. For the written 
jresponsc conditions, the sdme classification wn-u er»ployed, except that 
vertical line and dotv^were chosen to correspond to YES *ind NQ, rerpec- 
tively* The ^ wrote his response on a blank 8^^ x 11 in, sliect, but he 
did not visually monitor his writi?ig. 




Tlie nature of the acoustic and visual properties of letters was 
pointed out to S^, as well as the particular lyesponsc node he was to use. 
lie was then given four- practice, trials, one for each condition. He was 
told not to make any errors. Later, during the experiment proper, ?f ,an 
error wa^: made, it was pointed out to ^ at i.he end of the *rial. Tnis 
procedure leads to low error rates (Reber & Castlcman. X97D), and errors 
are not further considered in this stud/* . 

The order of presentation of the image-percept by \;i6uar*acoustic 
property conditions v/as random within each of six blocks. There were 
four different random lists/of the six blocks. The. beginning of a trial 
began with the E's designation of the tri<il as pe* ^ept /image ami t'he 
"^pr^esentaW-on of a catd"* indicating whether visual or acoustic properties, 
*wcre to Is proctis^sed. Response time for processing the 26 letters of 
the all^habet was. the dependent variable. The response time interval be- 
gan with j>r«sentation of the card and ended with.^ saying, "Stop," after 
confipleting the* last letter of the alphabet. iJimes were recorded on a 
Standard Elect^rc clock to the nearest .01 sec. 'The S was not given 
temporal feedback, but he was urged to go as rapidly as possible. There 
was about a 30-sec interval between successive trials whi*le the E re- 
corded the response time and re^et the cl^ck. 

Re'sults and Discussion 



^ Table 1 shot?a mean times to process the 26 letters of thd alpnabet 
as a. func£ion of conditions. Each mean is based on 10 Se and si^ t^irials* 
The SE - is^a'^between-S^s measure of variation. Significance t:ests,J^r 
main erfects indicate the following: Response time i5 greater for image 
than for percept representation, F(l,18) « 121.84, p < .01 Response' 
time is greater in the spoken than in the writ ten* mode F(i,18) « 5.53, 
p < .05. And response time is greater in the acoustic thian i*^ the 
visual property conditions, F(l,18) 15.85, j> < .01. 

The findings for main effects must be qualified, however. Perhaps 
of most interest are the significant two-way interactions: Response 
Mode by Image-Percept Reprefientatioh, F(l,18)>^ 56.57, j) < .01; and " 
Visual-Acoustic Property by Representation, F(l,18)= 30.77, p < .01. 
The significant Response Mode by Representation interaction indicates 
that the differences^ between spoken and written responses were much 
.greater in the image than in the percept conditions. This is to be ex- 
pected if verbal sequential control is involved in the image conditions. 
In that case^ the verbal control process would compete for processing 
space with the spoken ^ES/NO response. The significant Property by 
Reprteentation interaction indicates that the dif ferenae'betv/een visual * 
and acoustic property processing time is greater for percept than for 
"^^finage representation. Again, this is to be.e>:pected, since visual 
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prop^irties are directly available in the percept representation, where- 
as acoustic properties are not directly available* The Property by 
Representation interaction Yllustra-tes <:loarly that visual and acoustic 
imagination are separate process.ep in. accord with our subjective im- 
pressions* Neither the Response Mode by Property nor the triple inter-* 
action approached significance at" the/,05 4evel. 
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^ With regard to s"equential control, tlfe" objective results are con- 
sistent with the subjective impression that implicitly says each suc- 
cessive alphabetic letter before visualizing it ^nd abstracting its' 
spatial height property or before "hearing" :it and abstr^ic^ing its , ^ 
acoustic sound property.** For the'' image conditions, when a spoken YPS/NO 
te'sponse is required, it competes for verbal :process'ing capacity with." 
the implicit speech generatioji of t^he alphabetic sequence. Tire written 
response actually I'eads. vO more rapid, processing because it does not 
compete for prbcessing capacity with the implicit verbal generation of 
•the seque e. In a long $erial Ifst there i^;^ then, evidence for an 
;^portant interrelation of visua'' and implicit verbal codes. But this 
relation ^ esfikot hold tor the porcept conditions, where sequencing be- 
tween successive letters is no longer under verbal control. The se- 
quencing problem is aolved simply by moving the eyes from one successive 
letter form to. another. Thus, the difference between' spoken and written 
response time disappears in the percept *conditions.^ Of course, this 
argument applies only to along ordered string like-the alphabet . Un- 
doubtedly^ visual imagination for some "kinds of materials is lacking in 
a verbal sequencing component. 



Several, final comments are in order. First, when the means of 
Table 1 are converted to a rate measure,* the most rapid image condition 
at 1.75 letters per second is for visual properties with a written<. re-: 
sponse. This is very comparable to previous subjective measures of 
„ Visual imagery (Weber & Bach, 1969; Weber & Castleman, 1970, Experiment 
1), which neither required overt let ter -by-letter responding nor had the 
additional withii acoustic property task. On the basis of this- com- 
parison, we can be reasonably sure that the w^iTtTen response does not 
appreciably. interfere with and slow down visual image representation. 
Second, the most rapid imagined acoustic property condition also oc- 
curred with the written response mode and v/as 1.67 letters per second. 
This is in marked contrast to the approximately 6/sec rate of sa]^ing the 
letters implicitly (Weber ^Castleman, 1970). Thus, there is something 
quite different in "pure" visualization (without letter-by-lette'r overt 
responding) as compared to tl}^ "pure" verbalization (saying the succes- 
sive ^lettfers silently to one's self). But this difference -tends to 
disappear when each individual letter must be overtly responded to, sug- 
gesting a cbmplex set of relationships. Third, the unequal distribution 
of yes/no letter.s for the visual and acoustic property tasks probably 
gives rise to a conservative difference. Thus, if YES/NO uncertainty 
were equal for the two tasks', the acoustic property task might have been 
even slower. ^ 
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Note 



1. A •'few letters, such as "i," are ambiguous, but once specified by E^, 
there is no further "difficulty. 



kXPERIMENTS 4 AND 4 ^ VISUAL IMAGERY FOR WOPvDS 
(These experiments have now been submitted for publication. 
Experiment 4' vas not part of the original proposal 
but has been added for cprapleteness . ) 



Abstra<^t 



Two experiments were conducted- to examine Hebb's. test of the 
"picture theory" of visually unagining words. Experiment 4 
examined retriveal from visual image representations of words 
by using a method of probing for the spatial properties of 
- the i-th letter in an imagined word* Evidence was obtained 
for a limited capacity visual image system* that can represent, 
in a spatially parallel manner, three-letter words at least as 
efficiently as they can be represented in a visual percept 
system. Experiment 4' examined the growth of visual image 
representations of words. All parts of a word did riot grow 
simultaneously, and image growth did not in general proceed 
from-left to right. - The results of the two experiments 
were interpreted as indicating the existence of a vl/5ual 
/ image operating memory of between three and .five letters 
capacity, the contents of which can be examined in much the 
same way as visual percepts, ' ' 



Hebb (1966, 1968, 1972) and Woodworth and Schlossberg (1954) 
describe at length what we might term the "picture theory" of visual 
imagery. The picture theory consists of at least three facets • (1)' 
Subjective Experience, Many people report . visual images of whole lines 
or stanzas of verse, or at least of single *words. The presumption ^ 
would also seem to be that thefee images dre spatially- parallel, that^^is, 
all words or letters in the composite image are simultaneously available 
(2) Function. The use of visiiaL image representations of lines of 
text aids one's memory of the text.. In the case of words, it may aid 
in learning to -spellra word or in recalling the correct spelling of a 
word* (3) Description/Explanation. It seems that a mental picture 
cor image oari appear in one parf of the mind at which another part of 
the-jnind can look. ' . ^ 

Hebb proposes a test of the picture theory (1966, 1968, 1972). 
Hebb^stest seems to consist of ^t least the following propositions. 

(1) It, is a fact that when words are perceptually (visually) available 
they can be spelled almost as fast backwards as they can forwards. 

(2) To demonstrate the picture theory"" it , would be necessairy to show 
the following. If there is actually a. spatial visual image at which 
one can look, the ought to be able to spell the imagined word in a 
backwards! direction nearly as fast as in a forward direction. Thus 
after forming a clear iiiiage of a long; word such as "university," h6 
should be able to spell it backv;ard nearly as rapidly as forward. (3) 
Hebb finds howev^^r that the backwscrd spelling of the imagined word is 
much slower than the forward spellings. (4) He concludes that whatever 




the nature of the visual memory image, it is not like having a picture 
in the mind at which another part of the mind can look: "..*the 
""subjective impression that, one can /look at* one*^ image freely is-, 
shown by objective test to be wrong** (Hebb, 1966, pp. 43-44). Since 
it is possible to spell much mor.e rapidly .in a fbrward than in a back- 
vard direction **...the fact that the person with visual imagery can 
only -'jsee' the letters of a word in left-to-right order shows clearly 
that the memory image .^^.^^ is a series of events in a partxcular £/i3er, 
hot a picture v7hose part$ -could be looked at in any order** ^or 
simultaneously] I'Olebb, 1966, p. 46; italics added), Tims liebb's test 
for the picture theor-y of visual imagery consists of a general require- 
ment that there' e>:ist analogs between perception and imagery for the 
processing of spatial information ^nd a particular requirement tliat 

these analogs reveal themselves iu the spelling test. # 

\ 

It is the contention of this paper.»that, with* qualification, the 
picture theory of visual imagery is. at .least partly correct and also that 
Hebb's general test is a perfectly reasonable requirement.. However, 
we feel that Heb'b's pai;ticular test, a comparison of forward and backward 
spelling times, is not appropriate. In fact there are several problems 
associated with the spelling test.. (1) The spelling_tes£^sj not 
necessarily visual . If a is asked to form a visual imd^ge rof a t^ord 
and then spell it as rapidly as possible^in a' forward direction, his 
performance may not be based on a visual representation at all. He 
may simply draw on a highly practiced verbal/speech representation of 
the word, Hence if Hebb^s spellings test is essentially verbal in its 
requirements, then we should not be surprised that for a long word.it 
is- faster to spell in a forward than in a backward ^direction. After 
all, we hav^e year.^ of lectrning and repeated practice in the fon^ard 
1/erbal. spelling of long words^nd very little comparable training in 
backward spelling. Unwinding a serial verbal habit is eatSri,er if we 
begin at the initial step than if. we try to go Jjacfo/ards. It is of 
course possible that forward and backward spelling do not draw on the 
same processes. Ifeile fon^^ard spelling is probably a verbal/spe4ch 
process, it is possible that backward spelling does indeed draw on the 
visually imagined representation. But if this "is the case, we are 
simply comparing verbal sequencing with visual image sequencing and 
finding that the verbal process is more rapid. This is not surprising 
in view of the findings of Weber and Castleman (197b) that the letters 
of the alphabet can be spoken either aloud or silently at*^ rate of. 
about six per second, while sequentially visualizing the same letters 
occurs at a rate of about two per second. 

With visual percepts, \^ere the word is visually available, it 
also should not be surprising that the letters of the word ar^ spelled 
almost as rapidly in a backt^ard as in a forv^ard direction. .^Any small - 
difference that occurs in favor of fon^ard spelling may^ again mean that 
the is partly drawing on a verbal representation rather than on an 
exclusively visual representation in perception. ThuS there is nothing 
to prevent him from looking at the word but basing its spelling on a 
stored verbal sequence.; rather than the visually available letters. - 
JSowever, if the perceptually displayed word is to be spelled backward, . 
there would not be a stored verbal representation.^ The S would then 



rely on the visual string; and since all the visual information vou Id 
be simultaneously available, he could simply visually sc.an the vord 
,from right to left» He could presumably do this at about the same rate 
no matter how long the vord v/as because*he could simply move his cye$ 
from right to left. The capacity of the visual perception system 
vould be limited only to the number of letters in the perceptually 
presented line. r • " , 

(2) The visual image system may be very limite'd in 'capacity. Why , 
indeed' does Hebb insist on a long vord? If a short vord (e.g.-, "toy") 
is visually imagined, and there is "uncertainty on the part of S as to 
vhich direction he will be cued to spell the word, then it can readily 
be spelled backwards about as rapidly as fon/ards. This could be 
because the word is^^ithin the capacity of the visual image system for 
spatially parallel (simultaneous) representation, whereas longer words 
are not. TUat is, a longer word might have to be generated a syllable, 
or a few letters, at a time' in order not to exceed a limited capacity 
visual image system. It might also be possible, however, to speli "toy" 
as rapidly backwards as ,forx>rards on the basis of purely verbal processes.. 
For example, might first spell th word in a fprward difec.'ion and 
tj^n pull out the- backward spelling from a short-term operating memory 
delimited capacity.; It is in fact difficult to decide between the 
alternatives of visual versus verbal repres,entation for either short 
or long V7ords. Ivhen S reports the letter names he could be getting 
,them from a visiial image representation or from a verbally stored 
code of the v7ord that he has just recovered from saying it first in 
a forward direction and then recovering tfie backward direction from 
operating memory. - Of course, the fact Hhat S claims to h^ive a visual 
image of a word is not a guarantee that v/hen he spells it hq is basing • 
his spelling on the visual Representation rather than a verbal representa- 
tion. Even a short word yields ambiguity V7hen we try to -decide if it 
draws on verbal rather tha^ visual processes. What is needed i.s a way. 
of knowing thajg^S^is .drawing on a visual representation of a word. 7 

If had to respond to spatial Sep-tures of a word, then we could 
be more certain that;the instructions to imagine the word and then 
retrieve information from the imagined representation were being followed. 
"One solution would be to have S report on the visual properties of the 
letters in a word (Weber &- Castleman, 1970; Weber & Kelley, 1972;. Weber, 
Kelley Little, 1972). In particular, ^^e might ask S to imagine lower- 
case printed letters and to classify them for the spatial property of 
vertical height. lfe|e' betters that are vertically large (b, d, £, g, 
...,y) would /fall in 'the YES category and all! other letters would^fall 
-in the NO category. This ,procedure allows for a set of converging 
factors pointing to the conclusion that a visual image system rather 
than a verbal system is involved. The converging factors are; (a) 
ins'cructional set,- asking S to visualize and classify the letters in 
a word for their spatial properties; (b) the ability of Ss to correctly 
classify the imagined letters for vertical height; (c) frequent sub- 
jective ^reports .from Ss thatj they do visualize during bhe task; and (d) 
the finding (Weber and Castleman, 1970) of highly distinctive r'atHS-of- 
processing for visual image versus implicit speech instructionaT^ts; 
Weber, Kelley and Little (1972, Experiment 2' ) have already cbcipared 
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image representations for word and alphabet strings in which, the height 
property of each letter in the string was reported from left- to -right. 
Tlie results indicated different representations for the two types of 
strings, with alphabetiC^ representation sequential and word rcpresenta-^ 
tion at least partly parallel. However, even for words, the letter-by- 
letter report may v7ell have biased tovjard sequential representation of 
letters.. Tlius,-,in.,±ht present .expLeriiaeats,_a_pxobfi_is presented _ 
dicating a letter position; and the height of the letter'^ occupying the 
corresponding position in a test word is classified as YES/NO by the 
Ss. The. one word/oi^e ^probe technique avoids building in unnecessary 
sequential processing, minimizes response requirements, and may thereby' 
allow for maximum parallel representation of a visually imagined wotd. 



Experiment 4: Word-Probe 

This experiment is concerned with comparing retrieval times from 
an available image versus from an available perceptual representation. 
Specifically.,^ comparisons-- are made for image versus percept representations 
of three and five letter words. If visual imagery is more limited in 
capacity than visual perception, then we would expect percept -image 
differences in response time to be smaller for three than for five 
letter words; this should reveal itself as an inte^^acrion between 
representation mode and^word length. To the extfent that-.processing 
three and five letter words is within-the capacity of both the visual 
image and visual percept systems, and to the extent that these processes 
overlap iri shared components (Hebb, 1966, 1968, 1972; Neisser, 1972), 
then the following should also be the case. ^Response times (RT) should 
be similar in the sense that there would be ^o main effects for petccpt-- 
image representation or word length; and also there should be no inter- 
action between, serial prob^ position and repr.es entat ion mode. , In line, 
with this reasoning, it is possible to specify -several possible alterna- 
tive hypotheses regarding serial position curves relating response time*' 
and letters-position probed ♦ , ^ . " 

(1) Forward Serial Hypothesis. Reaction time would be a linear 
increasing function of the serial^position probed. Tliis would be con- 
sistent with a letter-by-letter, "left-to-right, self -terminating scan 
of the visual image representation of the word. Whenever the probed 
position was reached S would terminate the scan, abstract t^ie height 
property of the letter in the probed position, and resporfd YES or NO^. 
Since decision t:^me and response time should not vary with the letter 
position probed, the slope of the* function would represent the time per 
letter required to scan the image. j[ 

« 

(2) Backward Serial Scan Hypothesis. Here RT would be a linear de- 
creasing fanctix)n of the letter position probed. This Would indicate 
that S scans the image fromTright to left, i.^, in a backward directic^ocm 

(3) Random Access Hypothesis. Response time would be a flat 
function of the letter.^ position probed. This would occur because can 
"see" in a spatially parallel image nil the letters at once. Insofar as 

' ' '/ 
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this image approximates a perceptually. availabel word,_the imaga an'a 
percept functions should be comparable in form. ' '^4^' 

(4) Centerpoint S6an Hypothesis. Here S would form an image of 
the word, and fix his attention on the centerpoint (middXo letter) of 

^the word. Then if an initial letter position "is probed would, scan 
his image from the center to the left. If a terminal letter is 

'probedy-ST^wouid- sean-his - image-f rom-c enter -t-o- right , — Und ex -thes c-^ 

conditions the- serial position function should dip in the middle and 
rise toward the end positions, with an increasing number of letters 
scanned reflecting increased RT, 

(5) Centerpoint Interference Hypotl^sis,- Here also would 
generate an image of the word and hold it in focal attention. However 
the center of the word would yield a longer response time than the 
end^s. This could be because of (a) an information-seeking strategy in 
which S habitually extracts information from the most informative parts 
of a word first,* or*>(b) a '^'perceptual" „effect in which the end letters 
suffet less visual interference--and therefore information is easier to 
extract,., since the end letters are bounded on one side only by other 
letter^ (Woodv?orth & Schlosberg, 1954), aiid thus do- not suffer the 
same interference that would occur if letters were present on both 
sides. ; 

/ 

(6) Partial Word Hypothesis. In hypotheses (1) to (5) it has 
been /assumed thai: the whole image of the word is present at once. How- 
ever, if the size^'of the word exceeded the spatially -parallel capacity 
of the visual image system, ^ would necessarily produce the word a 
segment at a time until he produced the segment in which the probed 
letter appeared. The shape of the RT function might differ among 
individual Ss, depending on what segment of the word was produced first 
and V7hich of the preceding strategies was used. 

Method' 



Subjects .-^-Sixteen experimentally naive volunteer S^i^jwith normal or 
corrected vision were tested individually. Each £ was previously 
screened for the ability to respond to a visualized alphabet in a 
manner similax to Weber, and Castleman (1970). 

■ j# 

Stimuli a^d Procedure. --The test stimuli were nouns and adjectives 
with a Thorndike-Lorge count of five or more per 100,000 occurrences.^ 
A total of 42 different! words ^were us<J. Tr^elve were test words of 
five letters in length and twelve were test words >of three letters in 
length. In addition, ^there were eight filler words three letters in 
length (for statistical balancing purposes); and, finnliy, there were 
fiv.e practice words for each word length* Tlie twelve five-letter test 
words were selected such that lowercase vertically large-J.etters (b, d, 
f> g> h, j, . ; . , y) and vertically small letters (a, c, e, z) 
appeared equally in all five letter positions (fir^t through fifth). 
The twelve three-letter test words and eight three-letter filler words 



vere also selected so tliat vertically large and small letters appeared 
in all three-letter positions* 

Each S vas*given a preliminary spelling tr^st for the vords that 
were to be used. He vjas required to "spell" the v^ords both with letters 
and also with YES and liO indicating the vertical height of the letters. 
For example, the V7ord "cat" would be spelled "c, a, t," and "n6, Kp, 
YES," respectively. 

In - the-exj^eixiiieuL the ^LiKi uius woros were presented as linages or 

as percepts. 'For. the iraage conditions the E spoke the wotd and V7as\ 
instruclTGd to fonn a visual iraaga of it with his eyes cloi^od. Imagin- 
ing something visually was e>:plainod to be like "picturing it in the 
mind." For the percept condition each word was rear proj^ected with *i 
Kodak Carosel projector equipped with a solenoid operated shutter. ^ 
Letters appeared in elite type, lowercase form, on a 30*48 cm x 20.32 
cm rear projection screen, Tba vertically small .letters appeared 0.63 
cm in height^ Since vertically large letters vjere slightly irregular in 
height, the "j" is taken ^s the standajrd; an.d. it v;a5L 1.27 cm in projected 
height. The viewed the letters from a distance of 53.34 cm. 

Four seconds after presentation of a qtLinulus word the^E orally 
presente^d a probe digit ("one," "two^" "five"). The probe digit 

indicated a letter position in the word the S v/as imagining or" per- _ 
ceiving. In a word like "cat" the digit "one" referred to the "c," 
"tv7o," referred to the "a," and "three" referred to the "t," and similarly 
for the five-letter words. The spoken digit activated a Lafayette 
voice relay and started a Standard clock. #TIie £ then responded YES or 
NO to indicate whether the letter denoted by the probe digit was verti- 
cally large or not. The S^'s response stopped the clock, ^and k£ from 
onset of the probe to onset of the response was recorded. For .the per- 
cept conditions, after S^ ^^csponded the slide terminated and a blank 
adaptation field was displayed from another projector v;ith a solenoid 
shutter. All S^s were instructed -to respond cis rapidly as p'oissible, 
with 100 percent accuracjr. Any words which resulted in error were 
repeated at the end of the run. Hence all time scores were for correct 
choices only. , , _ 

Design .— There were ^two levels of word length (threerletter and 
five-fletter words) and txfo modes of representation (percept and image). 
Word length was a between-Ss variable and representation mode was 
within-S^s, with half of the Ss proceeding in , the order image-percept and 
the other half in the order percept-image. Within each of the between 
S^s conditions there were 60 trials (each test word appeared five times 
in the five-letter treatment and three times in the three-letter 
treatment). This allowed each letter position in each word to he tested 
once and only once; the additional three-letter filler words made it 
possible to equate practice on three- and JEive-le;tter wor^a. * Probe 
position was also treated as a separate within Ss variable for each 
word length. T\^o different random orders of words were used. 

Practiced — In addition .to the foregoing group conditions, the 
identical procediTres were replicated with a single j> highly practiced in 
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visual image tasks dealing with alphabet and\ord strings similar to 
those described in Weber and Cartlsman (1970) ^ Weber, Kelley., and 
Little (1972). This practice was of au infor.P.al sbrt and cannot b« 
quai-ititatively described, but it occurred over a pekod of several ~ 
months oij a fairly intensive' basis. " ." 



.Results aud Pis c us s ion 



ERIC 



; , ^^^"^^'^^•afc'^-^ the principal group result's in the left panel. 

Jiach S s merdian respoi-se time for a given condition was computed ard 
then the mean of the me-aic.ns obtained for Fig. 1. Thus each point \ 
represents 96 events (12 words x 8 Ss). lUMans were usod because of 
the substantial variabitity encountered. number to uhe. right of 

each function is the mca^ RT obtained by co .apsing across probe posi- 
tions. The right- panel la the S9me data 'or the practiced S but il- 
ls based on medians only, 12. events <(wards) per point. ~ . 

Sevcr.^^t^tistlcal analyses were conducted. In each case RT vas 
the -aependent variable. In the first analysis of variance, data was 
collapsed across probe positions. That i's, each S's score was the 
mean of his separate median probe position RTs. The resulting tvo-way 
, classification of two-vord lengths (three, five) by two representation 
modes (image, percept) was analyzed- with the first factor betveen-Ss 
and the second within-Ss. For word length, F CI, 14) = 5.20, p < 7o5- 
for repreffentati-on~mode,...}L_Cl.,^4) = .48, .05: and for- the Word ' ' 
Length by Representation interactioh^TF- (l--14^)-=^.82,_£_> .05. Thus- 

RT significantly increases with word length, but images and'^eFcept^ 

are not reliably different in how rapid ly^jktial information may be 
extracted from them. Indeed retrieval time from the three-letter image 
condition is at least as fast. as from the three-letter percept. The 
data for the practiced S collapsed across probe positions yields results 
somewhat comparable. The five-letter image condition requires the 
longest mean retrieval time, and the three- letter image and percept " 
conditions are very comparable. The absolute times are generally less 
for the practiced S. No statistical anaiy^is was possible, since only 
one was involved. . j - • 

♦ . " . 

The second series of analyses. examined probe position effects. 
Consider the three-letter vords. Here the analysis was based on a . 
two-way classification of three-probe positions (1, 2/ 3) by two 
representations (image, percept), with both factors with£n-Ss. For 
probe position, F (2, 14) « 7.64, < .05; for r'epresehtation mode, 
£ (1> 7) .94, J) > .05; and for the' probe position by representation - 
interaction, F (2, 14) ^ .09, £ > .05. TIius the dip in RT fbr the 
middle letter is significant, and this would- be ' consistent with the 
centerpoint scan hypothesis. To examine "more closely, the ppolJe position 
effect, separate Newman-Keuls tests were (ijerfonned for the three-letter 
percept and three-letter image conditions. Hie results are indicated 
in Fig. 1 by the letters adjacent to each 'point. Those points with 
a letter in coiranon do- not differ .significantly, i^vhile those points with- 
out a letter in common do differ significantly <£ < .05). Hence the 

i — ' ' ' ' * -j-^ - 
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three-letter percept condition may be tj^egaW^ as statistically flat, 
a result consistent vith the rwlc^i access hyi^othesie. Bi^fe ^hc three- 
letter image condition indicat?r*s; a signif icantJy^aster reti:ieN?al time 
vhen* the second-letter position o£ a vord is prob*i:^*x The probe posi- 
tion effect for three-let te^T words from the practiced sugj;esLs a 
right to left backrard scan of the word for both percept and image con- 
?ns • Kvidontly the strfttcgics^ nspfl r?rrv . n Qi>-^ 1 - i . ;Tn{!nyy - ^iitti ..s. 



resid 



, 7) = 1-88, p > .05; 
1 28) 2,28, 



result vary among Ss. 

In a comparable series of analyses for the five-letter \;ord condi- 
tions, the following results ^-yere obtained. For probe position, ? 
^(4, 28) = 4.97; ^< .65; for representation, 
,and for the Position by Ropresantatxon interact _ 
£> Rot all parts .of a word are equally ax^c^ilable as shovm by che 

significant probe position offectT^ And representation is not significant, 
possibly because of the substantial variability. The t^^-^^man-Keul^ tseiJt 
results are again summarized in Fig. 1.^ Tlie five-letter percept /.unc- 
tion is statistically flat, a finding consistent \:lth the random access 
hypothesis. However the^ five-Jet tor image function sliows a rua>:imum RX 
for the fourth probe position, and that fourth position RT i;? sigiiificantly 
greater than for the first and second positions. The first, second, 
third, and fifth p'tobn pc ^^io)ijs^?L-e statistically the same, since they 
are all covered by the coromon letter a» Positions three, four> and five 
are also statistically the ^ame, since they are covered by the common 
letter b. These results seem most consistent vjith the p?.rtial word 
hypothesis ^in which a fragraent ai: a time of the word is generated into 
visual operating memory. The practiced S -data for five-letter words 
follows a somev;hat* different pattern for the iiiiage conditions but seems 
reasonably flat for the percept conditions. 

It is appropriate to note that preliminary analyses were also per- 
formed in which"YES/NO responses were treated, as a separ-ate factor. No 
^consistent effects distinguishable from the present results were 
apparent,' - 

• To siimmari^e, there are differences between three- and five-letter 
words both in absolute RT and. in, the pattern of probe position effects. 
For image versus percept representation there do not seem to be signifi- 
cant differences in^ absolute RT, but there are differences in probe 
position effects. Finally, none of the hypotjiesized scan modes seem to 
•account for very much* of thu data from the image conditions. However, 
for the percept conditions a random access hypothesis is supported • 



EKperimejit 4*: Probe-Word 



* Tlie hypotheses of Experiment 4 have bearing on the retrieval of 
spatial information from an imag§ that'' is all or partly available at the 
time the probe is presented. However it is possible that the g rowth 
of generation of an image is distinguishable "from retrieval from an 
available image. For example. Experiment 4 T^ndicated that Ss do not 
retrieve the visual property of the 1-th letter .by scanning the 
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co5iplo<:Gd image from left to right. But, it is still possible that 
the image is generated [left to right in a sequential fashion. This is 
clearlj^ the caSo with alj^habetic sequences x.'hich require verbal -sequencing 
between letters (Vrehei: jKelley, & little, 1972), ^nius with an alt^habt-^tic ' 
list £ s eems to gay oa c ^ ]ir>H'nv ^.^^^y^ .. ^rnrr: 



In con- 



Tirast, with words it isjnot the case that S savs the successive letters 
in order to genorace a jisual representation of its letters (Weber, ' 
et al., 1972). Yet if the woid is presented l>raily, it inight still i( 



. I'* 



-I: 



proc<4SScd as it comes. 
We Ginphasij:e the ineye j 
the probe ^Kl,J±UHn_J:iie \. 



lat is, £ro-.t left to right. In Expe-riment 4' 
eneration or ^$ro<.'th question by first present 
ord that' is tS ipaginod. Instead oi un 



»Xbe v:'.'rd uncertainty is, 
uncertainty. 

Mc'tl^od 



c;,ertaiuty'^r probe position (Expcrinent 4) A^e noxc have word uncertaint y- 



of couirse^ greater than the probe position 



Sixte^en cxperiTnontallly naive 'Ss with 
p^id for' tiieir participation and were test 
materials and procedures were identical to 
except that in this, case ^he- probe di^it w 
v^s followed by the stimulus word. The hi 
Experiment ^ was also teslled, but on diff 
4. lu.^ 



Results ^and Discus s ion 



iiormal or corrected vision we«fe 
id individually. All stiraulus 
those of Experiment 4 * 
ys presented first and then > 
i(hly practiced 5 from 
ent days than for Experiment 



Fig. 2. Tlie treatment of 
Experiment 4. \fnen the 
resulting two-v7ay analysis . 
i^ rates that word length is 
It representation mode is • 
md that the Word Length 
I (1, 14) - 9.72, £ < ..05. 
gnificant in Experiipent 4^ the 



,Tlie principal results are summarised 
•fl^e .results* is. exactly parallel to that jc 
probe' ]jo*jiJ?ion variable is collapsed, thl 
of woi-d* Iqicigth by representation mode incii< 
significant, F (1, 14) =^ 7.69, £^ < .05; ^b 
significant; F (i, 14) « 249.58, £ < .05 
Representation interaction is significant, 
While the feord length variable was also^:5i 

representation mode variable shov;s a subjJtJantial effect here in con- 
.trast to the first experim.ont. This is jJobably not too surprising; 
sinc^ the image representations must be ginerated before information 
from the ima^e can be" retrieved. In contrast, the percept conditions 
require no cpmparable generation before the abstraction of ^infgrmation 
can begin. jSince the presbnt i'mSge versus. percept comparison is most 
•comparable to llebb^s (1966) 'spelling tes*^, it is**to be expected that it * 
.takes longer to generate and retViovei spdtial properties of> images than 
it does to sim|)ly retrieve the. spatial properties of aji^ already available 
image "or percept.** Tlie signiSicant Wo'fd length by Representation 
interaction is indicative of the limited/ jcapacity of the visual image' 
system. As the worJ to be geijerated becjo.mes longer and exceeds pro- 
cessing capacity, RT becomes disproportidnally longer for the image* 
conditions in comparison to the percjept jconditiops. 
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Separate serial po»iition analyses \v*erb* perfornied ^cr each vord 
. lengths For Uhree-lcttor voxds an anciysis of Probe Posiition by 

^ ^" I2rn?57 E< -05; a sigiu£i^hc rc-prosoncation effect^ T (1, 7) = 3^30.73, 

2< ,05; and no siLoii f leant interact ion,. -F (2, 14> = 2.^)3, £> ,05, Tiic 
Newan-Kculs cests 'for 'chrec-ler.ter •^-^ard conJitioas iinlicate^ that th.^ 
percept function is statiitieally •£ lat and that for tb\ h^nyo function " 
each point is sfaisticaWy different ' from the olhoi tv:o^, Tixa incrcas- 
i • • ing three-lettpr in4:.p,e function is consLstsnL xcith a lei't-to-riAnt 

f - - generr*Liou of the imago- The practiced S'^jf thrtc-lettr.r ia*.\<;e data is, 

\ however, quite flat anQ'is cKvi&istont vitii the hyrpthe;3is of r^^ndoni ' 

^cceLs to an iniDfy that ^*,s comp],ctely and ;>jLniiiitancbusly avaii^xblo, 

; * ' _ ^'^^^ serial portion anaiy cs" for fiverletter vords indicate tnc 

^ following* For the-ijrobe positioix by reprcsencation anolynis, position 

is significant, V (4, 2y) = 26,76, jp < .05; representation is signifx- 

* cant, F (1, 7) ^ 99.06, .o < -.OS; and the TSsition by Ueprescrtation 

'i ' interaction is -siguif leant, F (4/28) = 13,77, p < .05/ Tlic Newn^an- 

I Keuls test indicate,<i a statistically flat pex*copt function. Xhe ima*n 

1 * function,' box^'cver, has statistically ^cc»;npar-ab] e values^ for probe possi-- 

i tions one and tro^ comparoble* values 'for probe posit ion:5 ;'chx-oe and five^, 

\ and a nia2<inum vafi-.e at posxtion £ou^ • Vliis is not consi^tcat Xv'ith - 

^ ^ left-to-right iiha^^e generation. As in E::p^riment 4 the pattern of 

|< ' gene^ition is suggestive' of the pai;tial v/ord hypotheiis^ with positions 

-t ^ — and two geneiflited fjirst and concurrently, foJlot/ed hy generation o£ 

^ the remaining positions, when necessary. Tlio data for the practiced S 

\ ' indicates a much fUt^er five-letter image xunction. i!is absolute r:c 

y is also nmch less than that of the group data, 

i . " ^ • ' % * ^ . 

In summary, as in Experi.me^it 4 ptarcept functions remain statistically 

J flat e\^n when the p;robe is presented prior to the. word. Geuerati'on 

? processes of the type st ..nd here ar^ minimal in perception, and random 

.| • access to visually available letters se*ems to be the result.' However, 

'I image functions sl\ow a sharp inci-ease* from left-to-rxght. )int it, would 

I " he XiTTong to <^ojiclude: frpm this that -images are ^^rnere.ted fronrieft-to* 

^ I right, because the- increase is not mono tonic £t)x^ five-letter words and 

I _^thQ^ data from tBe practxced*'S indicates reasonaoly 'flat image funct:.ons. 

t , Fitfallv, this escperiment shows a bstantial time difference between 

i percept and image conditions *t*hat Vas* not* seen in Experiment .4. . It 
takes time to generate images (Web6r 6e Castleman, 1970). 

f ^ . ^ . * • . 

^--> Conclusion ** ' . ^ . * ^ 

* • 

- ^ A modified version of the Hebb test of tlie picture tflieory of imagery 

has revealed that when long (five-letter) x^yords are "spelled" for the 
spatial properties of their letters, thsre may indeed/ be seme differences 
between the image ""and perceift systems. But if thi*- amount of infoi-matioif 
f - the vjLsual image system is small (three letters), so that thji visual 

■* image capacity is nr*- exceeded, then spatial information* can be re- 

trieved about as ra .dly as when the woyd is presented To the vistial 
perceptual system/? • ^ \ , • ^ ^ 
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Our explanation for STis is that the S is able to output from iowg 
term visual maniory ihto a limit»id capacity operating m««ory (focal 
attention) a spatial representation of a vord. Once the spatial features 
of a vord ^re^visually represcntcJ in operating memory, they can then be 
examned by other inforiT?atiou processing routines^ TTiesc r.^itines are 
capable of eKaniihing the -spatial pr.^perties of a letter in a ^iven probe 
position, and an appropriat-c V^S/^O decis-ioa can be made. In Ibis 
sense, then, "one .part of tL xnd can examine tre activity o£ aiother 
part of the mind, contrary to Hebb's conclu^ .on ;i966). ^Thus Hebb's 
general criterion for visual iinagery of vcrds is perfectly appropriate, 
but his bawkprrd and forcvm'd spelling test for perceptually available 
versus imagined words is not appropriate for tw re^^s^-^^^s*. First, as 
usually app-lied, the test v^dv^ocates the use of lon^ w ds^, and any 
limited capacity system such as the visual iirjagery mode would be 
iiamediatcly overloaded* i^gery need not be involved at 

all for some spelling: coiidxtxons; since S could draw on various verbal 
processing routines to spell the vord raUieV' tlian on anv visual imago 
representation he might have* 

VThil-e the picture theo^;f fof visual imagery may be more correct than 
psychologists have been wl-iling to grant, there are still some, obvious 
difference's between imagery and perception- Hiose differences involve 
at least the following: capacity and sequencing considci^ations , sex .a 
.pon-iti^tt^e-f^ects, practice effects, and subjective stability. 

The present. results suggest that the capacity oS the visual i-^age 
system for simultaneous letter representation is between t .*ee and 
fiye letters for high frequency words, Exp-^riment 4 reveaiij that the 
image system does reasonably \vcll \cith three-letter words, but it begins 
to show signs of strain (departures ^rom flatness and percept-image 
differences) with five-letter wotdsJ Experiment 4' indicates that the 
manner in \^hich the -visual image capacity is filled may be consistent 
with left-to*right generation into visual operating memory for the 
diree-letter image conditions bj^it not for th^ five-letter conditions, 
' The data for the practiced' S?is even- less- consistent with a left-to- ' 
rigt^t visual imagp growth, ^ , . ^ 

It is of particular interest to knov how t^>e capacity limitations 
of the visual imagery system are jfandled, WeberV helley "and Little 
(19.72) and Webej: and Kelley (1972) have suggested that 'the limited 
ff?f^cap;^^'4ty of tJhe visual image system creates a sequencing problem. That 
is^ when a -spatial array of letters to be imagined exceeds the capacity 
of the visual image systigm for spatially parallel representation, it 
becomes necessai;y.£o geiyfrajte: the array into the visual imagery system 
* a piece/at a time, Tvlien this is done for alphabetic lists or long T 
words (/"supercalifragilisticexpialidoc-^ous^O, if. seems that a verbal" 
seqilencing operation is employed in t' following way. The sequential 
ordering for such a long letter list yerbally encoded* To represent 
this order in the limited capacity vi .^1 image system, the .S says each 
successive letter name or syllable to nimseli: prior to visually imagining 
it (Weber, Kelley, & Little, 1972). .:rhi3 is in sharp contrast to ^ . 
perceptual presentation where the S' need only move his eyes from one 
letter or syllable to another (We^er & Kelley, 1972) in order to 
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represent a letter prior to abstracting its spatial prop^^^^e^^^^.^luis 
the capacity of the visual perceptual system is unlimited as loiig as 
the time ::equireiuents are not so, stringent as to limit vio^al scaiining 
or ejremovenient from ona letter locus to another. 

For both experiments, retrieval from perccpt*5 as^.a function ofo 
position probed looks like a random access procesr* But ther: doe{> not 
seem to be any great consistency in rv*'rievai from or generation of 
inagcs that makt^is sense in line with the var-.ous hypotheses described 
in ExB^«^.ment 4. Also, in both expcrijaents the RTs of i:he practiced 

arelgenerally faster than those of the naive group, but differentially 
so for ima^^e and percept conditions. Thi.*? could be'c]ue to individual 
differWifK^s., but: V7e believe (without strong evid4?nce) that it means 
that visual image generation and retrieval can profit from extensive 
practice.. The limits of such improvement wuld be of interest in its 
o\m right but , would refiuire more systematic study o.f the over 
time. . " ' ' 



Finally , th ere . are undoubtedly image-percept differ ences'^xn 
stability, of representation* Images seen) to fade in and out ^chile^ per-- 
cepts 'are relatively stable. The t3.me course of image evanescence -^ould 
also be of interest. 
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CONCLUSIONS 



A series of six experiments has been conducted \9hich demonstrate a 
numbe'^ of phenomena related to the visual image reprt>i:entation of let- 
ter^ and words* Those phenomena include the following; 

e 

(1) Sequencing between iiuages of lettur^s in aa unpronounceable 
string like the alphabet is under verbal controlf the. subject encodes 
and decodes 'the sequence verbally; his generation of a visual image 
then depends on his ^xirjned lately prior verbal ^ictivauion 01 the aame of 
the letter. Hence, this is a sequential, one-lecter'^at-a-t.ime process. 

r(2) Sequencing bett?een visually imasined letters in a pronounce- 
able string, such as a short workj is not under verbai control. This 
is becaus,e the entire ^^rd can be represented simultaneously in yisual 
imagination and there is no corresponding need to verbally generate it 
a letter at a' time*. . 



(3) The capacity of the visual image system for simultaneous 
representation "of letters i^ very-^i^tited: approxinately between 3*-5 
letters for short easily pronounceable letter strings (words) i 

(4) Visual percept .ajid vi.sual image representations of word.- are 
very comparable for .thrclie -letter words l)ut x-zith longer words the capac- 
ity of ithe imagery system for simultaneous letter representation begins 
to be^^xceeded* JJhen this occurs' ^ a sequencing problem arises: hov7 to 
get from an initial letter (or SBtntSf *~*tetters) to,^ the next letter (s) . 
Tiiis is done differentially for the tc-yo systems, by implicit verbal " ■ 
cbntrol for the image system and by .simple eye movemanu or scanning for 
the percept system. 

The cap^** 'Ities of the visual image system for representing letters 
and word's s^^^gest .that the teaching of spelling might profitably <Smploy 
visual as \rell as verbal* encoding. • Fpr example, the word *^child^* wh.^. 

"^'spelled" in terras of its visual propprties^becomes "NO, YES,-' NO, YES', 
YES." If a student represents a word with thQ appropriate sequence of 

.yes/no, we can be reasonably sure that. he has a visual encoding of 
the" word. - The extent to which , visual" encoding and rehearsal might aid', 
spelling vyarrants follow up/ * 
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